Goto

Collaborating Authors

 contextual learning


T\"urk\c{c}e Dil Modellerinin Performans Kar\c{s}{\i}la\c{s}t{\i}rmas{\i} Performance Comparison of Turkish Language Models

Dogan, Eren, Uzun, M. Egemen, Uz, Atahan, Seyrek, H. Emre, Zeer, Ahmed, Sevi, Ezgi, Kesgin, H. Toprak, Yuce, M. Kaan, Amasyali, M. Fatih

arXiv.org Artificial Intelligence

The developments that language models have provided in fulfilling almost all kinds of tasks have attracted the attention of not only researchers but also the society and have enabled them to become products. There are commercially successful language models available. However, users may prefer open-source language models due to cost, data privacy, or regulations. Yet, despite the increasing number of these models, there is no comprehensive comparison of their performance for Turkish. This study aims to fill this gap in the literature. A comparison is made among seven selected language models based on their contextual learning and question-answering abilities. Turkish datasets for contextual learning and question-answering were prepared, and both automatic and human evaluations were conducted. The results show that for question-answering, continuing pretraining before fine-tuning with instructional datasets is more successful in adapting multilingual models to Turkish and that in-context learning performances do not much related to question-answering performances.


Beyond Interpretable Benchmarks: Contextual Learning through Cognitive and Multimodal Perception

DiSanto, Nick

arXiv.org Artificial Intelligence

With state-of-the-art models achieving high performance on standard benchmarks, contemporary research paradigms continue to emphasize general intelligence as an enduring objective. However, this pursuit overlooks the fundamental disparities between the high-level data perception abilities of artificial and natural intelligence systems. This study questions the Turing Test as a criterion of generally intelligent thought and contends that it is misinterpreted as an attempt to anthropomorphize computer systems. Instead, it emphasizes tacit learning as a cornerstone of general-purpose intelligence, despite its lack of overt interpretability. This abstract form of intelligence necessitates contextual cognitive attributes that are crucial for human-level perception: generalizable experience, moral responsibility, and implicit prioritization. The absence of these features yields undeniable perceptual disparities and constrains the cognitive capacity of artificial systems to effectively contextualize their environments. Additionally, this study establishes that, despite extensive exploration of potential architecture for future systems, little consideration has been given to how such models will continuously absorb and adapt to contextual data. While conventional models may continue to improve in benchmark performance, disregarding these contextual considerations will lead to stagnation in human-like comprehension. Until general intelligence can be abstracted from task-specific domains and systems can learn implicitly from their environments, research standards should instead prioritize the disciplines in which AI thrives.


Contextual learning is nearly all you need

#artificialintelligence

In another article, Sunghoon Kwon and colleagues show (Y. Lee et al. As explained by Faisal Mahmood and co-authors in an associated News & Views (G. In another research article in this issue, Mahmood and colleagues show another application of self-supervised learning: searching and retrieving gigapixel whole-slide images (Figure 1) at speeds that are independent of the size of the repository (C. To search for a tissue patch, rather than querying against every slide in the dataset, a variational autoencoder (a probabilistic generative model that learns latent representations of the data) is trained to represent select patches from each slide as a set of codes in a manner that the patches with the highest chances of matching the query can be retrieved by leveraging uncertainty-based ranking and a tree data structure for speed efficiency and scalability.


In a world where machines and AI rule, re-skilling is the only way out

#artificialintelligence

Gartner says more than 3 million workers across the world will have a'robo boss' by 2018. High time businesses reorient skill development programs to help mid-level managers stay relevant. In July, the Vodafone-Idea merger was approved by the Competition Commission of India (CCI). The mega deal will make the shareholders of both companies become part of the largest telecom company in India, and reward them in the future. It will also create a situation that can quickly escalate into a nightmare.